Feature selection with ensemble learning for prostate cancer diagnosis from microarray gene expression
نویسندگان
چکیده
Cancer diagnosis using machine learning algorithms is one of the main topics research in computer-based medical science. Prostate cancer considered reasons that are leading to deaths worldwide. Data analysis gene expression from microarray and soft computing a useful tool for detecting prostate diagnosis. Even though traditional methods have been successfully applied cancer, large number attributes with small sample size data still challenge limits their ability effective Selecting subset relevant features all choosing an appropriate method can exploit information improve accuracy rate detection. In this paper, we propose use correlation feature selection (CFS) random committee (RC) ensemble detect expression. A set experiments conducted on public benchmark dataset 10-fold cross-validation technique evaluate proposed approach. The experimental results revealed approach attains 95.098% accuracy, which higher than related work same dataset.
منابع مشابه
Feature Selection for Cancer Classification Using Microarray Gene Expression Data
The DNA microarray technology enables us to measure the expression levels of thousands of genes simultaneously, providing great chance for cancer diagnosis and prognosis. The number of genes often exceeds tens of thousands, whereas the number of subjects available is often no more than a hundred. Therefore, it is necessary and important to perform gene selection for classification purpose. A go...
متن کاملGene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method
Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...
متن کاملFeature Selection Combined with Random Subspace Ensemble for Gene Expression Based Diagnosis of Malignancies
The bio-molecular diagnosis of malignancies represents a difficult learning task, because of the high dimensionality and low cardinality of the data. Many supervised learning techniques, among them support vector machines, have been experimented, using also feature selection methods to reduce the dimensionality of the data. In alternative to feature selection methods, we proposed to apply rando...
متن کاملMinimum Redundancy Feature Selection from Microarray Gene Expression Data
How to selecting a small subset out of the thousands of genes in microarray data is important for accurate classification of phenotypes. Widely used methods typically rank genes according to their differential expressions among phenotypes and pick the top-ranked genes. We observe that feature sets so obtained have certain redundancy and study methods to minimize it. We propose a minimum redunda...
متن کاملDiagnosis of Breast Cancer Subtypes using the Selection of Effective Genes from Microarray Data
Introduction: Early diagnosis of breast cancer and the identification of effective genes are important issues in the treatment and survival of the patients. Gene expression data obtained using DNA microarray in combination with machine learning algorithms can provide new and intelligent methods for diagnosis of breast cancer. Methods: Data on the expression of 9216 genes from 84 patients across...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Health Informatics Journal
سال: 2021
ISSN: ['1741-2811', '1460-4582']
DOI: https://doi.org/10.1177/1460458221989402